智能论文笔记

Hybrid Quantum-Classical Generative Adversarial Network for High Resolution Image Generation

Shu Lok Tsang , Maxwell T. West , Sarah M. Erfani , Muhammad Usman

分类：计算机视觉 | 机器学习

2022-12-22

Quantum machine learning (QML) has received increasing attention due to its potential to outperform classical machine learning methods in various problems. A subclass of QML methods is quantum generative adversarial networks (QGANs) which have been studied as a quantum counterpart of classical GANs widely used in image manipulation and generation tasks. The existing work on QGANs is still limited to small-scale proof-of-concept examples based on images with significant down-scaling. Here we integrate classical and quantum techniques to propose a new hybrid quantum-classical GAN framework. We demonstrate its superior learning capabilities by generating $28 \times 28$ pixels grey-scale images without dimensionality reduction or classical pre/post-processing on multiple classes of the standard MNIST and Fashion MNIST datasets, which achieves comparable results to classical frameworks with 3 orders of magnitude less trainable generator parameters. To gain further insight into the working of our hybrid approach, we systematically explore the impact of its parameter space by varying the number of qubits, the size of image patches, the number of layers in the generator, the shape of the patches and the choice of prior distribution. Our results show that increasing the quantum generator size generally improves the learning capability of the network. The developed framework provides a foundation for future design of QGANs with optimal parameter set tailored for complex image generation tasks.

translated by 谷歌翻译

Efficient brain age prediction from 3D MRI volumes using 2D projections

Johan Jönemo , Muhammad Usman Akbar , Robin Kämpe , J Paul Hamilton , Anders Eklund

分类：计算机视觉 | 机器学习

2022-11-10

Using 3D CNNs on high resolution medical volumes is very computationally demanding, especially for large datasets like the UK Biobank which aims to scan 100,000 subjects. Here we demonstrate that using 2D CNNs on a few 2D projections (representing mean and standard deviation across axial, sagittal and coronal slices) of the 3D volumes leads to reasonable test accuracy when predicting the age from brain volumes. Using our approach, one training epoch with 20,324 subjects takes 40 - 70 seconds using a single GPU, which is almost 100 times faster compared to a small 3D CNN. These results are important for researchers who do not have access to expensive GPU hardware for 3D CNNs.

translated by 谷歌翻译

Does an ensemble of GANs lead to better performance when training segmentation networks with synthetic images?

Måns Larsson , Muhammad Usman Akbar , Anders Eklund

分类：计算机视觉

2022-11-08

Large annotated datasets are required to train segmentation networks. In medical imaging, it is often difficult, time consuming and expensive to create such datasets, and it may also be difficult to share these datasets with other researchers. Different AI models can today generate very realistic synthetic images, which can potentially be openly shared as they do not belong to specific persons. However, recent work has shown that using synthetic images for training deep networks often leads to worse performance compared to using real images. Here we demonstrate that using synthetic images and annotations from an ensemble of 10 GANs, instead of from a single GAN, increases the Dice score on real test images with 4.7 % to 14.0 % on specific classes.

translated by 谷歌翻译

MEDS-Net: Self-Distilled Multi-Encoders Network with Bi-Direction Maximum Intensity projections for Lung Nodule Detection

Muhammad Usman , Azka Rehman , Abdullah Shahid , Siddique Latif , Shi Sub Byon , Byoung Dai Lee , Sung Hyun Kim , Byung il Lee , Yeong Gil Shin

分类：计算机视觉

2022-10-30

In this study, we propose a lung nodule detection scheme which fully incorporates the clinic workflow of radiologists. Particularly, we exploit Bi-Directional Maximum intensity projection (MIP) images of various thicknesses (i.e., 3, 5 and 10mm) along with a 3D patch of CT scan, consisting of 10 adjacent slices to feed into self-distillation-based Multi-Encoders Network (MEDS-Net). The proposed architecture first condenses 3D patch input to three channels by using a dense block which consists of dense units which effectively examine the nodule presence from 2D axial slices. This condensed information, along with the forward and backward MIP images, is fed to three different encoders to learn the most meaningful representation, which is forwarded into the decoded block at various levels. At the decoder block, we employ a self-distillation mechanism by connecting the distillation block, which contains five lung nodule detectors. It helps to expedite the convergence and improves the learning ability of the proposed architecture. Finally, the proposed scheme reduces the false positives by complementing the main detector with auxiliary detectors. The proposed scheme has been rigorously evaluated on 888 scans of LUNA16 dataset and obtained a CPM score of 93.6\%. The results demonstrate that incorporating of bi-direction MIP images enables MEDS-Net to effectively distinguish nodules from surroundings which help to achieve the sensitivity of 91.5% and 92.8% with false positives rate of 0.25 and 0.5 per scan, respectively.

translated by 谷歌翻译

Image Quality Assessment for Foliar Disease Identification (AgroPath)

Nisar Ahmed , Hafiz Muhammad Shahzad Asif , Gulshan Saleem , Muhammad Usman Younus

分类：计算机视觉

2022-09-26

作物疾病是对粮食安全的主要威胁，其快速识别对于防止产量损失很重要。由于缺乏必要的基础设施，因此很难迅速识别这些疾病。计算机视觉的最新进展和智能手机渗透的渗透为智能手机辅助疾病识别铺平了道路。大多数植物疾病在植物的叶面结构上留下了特定的文物。这项研究于2020年在巴基斯坦拉合尔工程技术大学计算机科学与工程系进行，以检查基于叶片的植物疾病识别。这项研究为叶面疾病鉴定提供了基于神经网络的深度解决方案，并纳入了图像质量评估，以选择执行识别所需质量的图像，并将其命名为农业病理学家（AGRO PATH）。新手摄影师的捕获图像可能包含噪音，缺乏结构和模糊，从而导致诊断失败或不准确。此外，Agropath模型具有99.42％的叶面疾病鉴定精度。拟议的添加对于在农业领域的叶面疾病鉴定的应用特别有用。

translated by 谷歌翻译

Face Anti-Spoofing from the Perspective of Data Sampling

Usman Muhammad , Mourad Oussalah

分类：计算机视觉

2022-08-28

如果不部署面部反动体相反的对策，则可以通过呈现印刷照片，视频或真实用户的硅面膜来欺骗面部识别系统。因此，面对表现攻击检测（PAD）在提供对数字设备的安全访问方面起着至关重要的作用。大多数现有的基于视频的垫子对策都无法应对视频中的远程时间变化。此外，在特征提取步骤之前的键框采样尚未在面部抗散热域中广泛研究。为了减轻这些问题，本文通过提出一种视频处理方案来提供一种数据采样方法，该方案基于高斯加权功能进行建模远程时间变化。具体而言，提出的方案将视频序列的连续T帧编码基于T帧的高斯加权总和，将视频序列的连续T帧编码为单个RGB图像。仅使用数据采样方案，我们证明可以在三个公共基准数据集的数据库内和数据库间测试方案中没有任何铃铛和哨子来实现最先进的性能；即，重播攻击，MSU-MFSD和CASIA-FASD。特别是，与跨数据库情景中的基线相比，该计划的误差（CASIA-FASD的15.2％至6.7％，重播攻击的5.9％至4.9％）。

translated by 谷歌翻译

HTML版本

Self-Supervised Face Presentation Attack Detection with Dynamic Grayscale Snippets

Usman Muhammad , Mourad Oussalah

分类：计算机视觉

2022-08-27

面部表现攻击检测（PAD）在防御面部识别系统免受演示攻击方面起着重要作用。 PAD的成功很大程度上依赖于需要大量标记数据的监督学习，这对于视频尤其具有挑战性，通常需要专家知识。为了避免昂贵的标记数据收集，本文提出了一种通过运动预测进行自我监督视频表示学习的新方法。为了实现这一目标，我们基于三个RGB框架利用时间一致性，这些RGB帧在视频序列中以三个不同的时间获取。然后将获得的帧转换为灰度图像，其中每个图像被指定为三个不同的通道，例如R（红色），G（绿色）和B（蓝色），形成动态灰度片段（DGS）。由此激励，这些标签会自动生成，以通过使用视频的不同时间长度来基于DG的时间多样性，这对下游任务非常有帮助。从我们方法的自我监督性质中受益，我们报告了结果，结果表明，在四个公共基准数据集上的现有方法，即重播攻击，MSU-MFSD，CASIA-FASD和OULU-NPU。解释性分析是通过石灰和Grad-CAM技术进行的，以可视化DGS中使用的最重要功能。

translated by 谷歌翻译

An Overview of Structural Coverage Metrics for Testing Neural Networks

Muhammad Usman , Youcheng Sun , Divya Gopinath , Rishi Dange , Luca Manolache , Corina S. Pasareanu

分类：人工智能 | 机器学习

2022-08-05

深度神经网络（DNN）模型，包括在安全 - 关键域中使用的模型，需要进行彻底测试，以确保它们在不同的情况下可以可靠地表现良好。在本文中，我们提供了用于测试DNN模型的结构覆盖量指标，包括神经元覆盖（NC），K-Multisection神经元覆盖范围（KMNC），TOP-K神经元覆盖范围（TKNC），神经元边界覆盖率（NBC），强元（NBC），强神经元激活覆盖范围（SNAC）和修改条件/决策覆盖范围（MC/DC）。我们评估用于感知任务的现实DNN模型（包括LENET-1，LENET-4，LENET-5和RESNET20）以及自治（TAXINET）中使用的网络的指标。我们还提供了一个工具DNNCOV，可以测量所有这些指标的测试覆盖范围。 DNNCOV向研究人员和从业人员提供了一份信息丰富的报道报告，以评估DNN测试的充分性，比较不同的覆盖范围，并在测试过程中更方便地检查模型的内部。

translated by 谷歌翻译

Automated Quantum Circuit Design with Nested Monte Carlo Tree Search

Pei-Yong Wang , Muhammad Usman , Udaya Parampalli , Lloyd C. L. Hollenberg , Casey R. Myers

分类：机器学习

2022-07-01

基于变异方法的量子算法是构建量子溶液的最有前途的方法之一，并在过去几年中发现了无数的应用。尽管具有适应性和简单性，但它们的可扩展性和选择合适的ATZ的选择仍然是主要的挑战。在这项工作中，我们报告了基于嵌套的蒙特卡洛树搜索（MCTS）的算法框架，并与组合多部队的bastit相结合（ CMAB）模型，用于量子电路的自动设计。通过数值实验，我们证明了应用于各种问题的算法，包括量子化学中的地面能量问题，在图上进行量子优化，求解线性方程的系统，并找到编码编码与现有方法相比，用于量子误差检测代码的电路，结果表明我们的电路设计算法可以探索更大的搜索空间并优化较大系统的量子电路，从而显示出多功能性和可扩展性。

translated by 谷歌翻译

Performance analysis of coreset selection for quantum implementation of K-Means clustering algorithm

Fanzhe Qu , Sarah M. Erfani , Muhammad Usman

分类：机器学习

2022-06-16

预计量子计算将提供巨大的计算能力，可以为许多数据科学问题提供有效的解决方案。但是，当前一代的量子设备很小且嘈杂，这使得处理与实际问题相关的大数据集变得困难。核心选择旨在通过减少输入数据的大小而不损害准确性来避免此问题。最近的工作表明，核心选择可以帮助实施量子K-均值聚类问题。但是，尚未探索核心选择对量子K-均值聚类性能的影响。在这项工作中，我们比较了两种核心技术（BFL16和Oneshot）的相对性能以及每种情况下的核心结构的大小，相对于各种数据集，并布局在实现量子算法中的核心选择的优势和局限性。我们还研究了去极化量子噪声和位叶片误差的影响，并实施了量子自动编码器技术以超过噪声效应。我们的工作为未来在近期量子设备上实施数据科学算法提供了有用的见解，这些量子设备通过核心选择减少了问题大小。

translated by 谷歌翻译